Unmonitored Legacy Data Identification — Project Mockup

Project Overview

Project NameUnmonitored Legacy Data Identification
TechnologiesPython, ML, Data Mining, Cloud
Problem StatementLegacy data often unmanaged, risking compliance breaches
AI ComponentData classification & anomaly detection
SolutionAI scans and flags legacy datasets for security and compliance reviews
ImpactReduces data risk and improves global compliance efficiency

Sample Dataset — Legacy File Inventory

File ID File Name Size (MB) Created On Last Accessed Storage Path
F001 employee_records_1998.xls 12.5 1998-06-12 2007-05-10 /archive/hr/1990s/
F002 old_vendor_contracts.zip 87.3 2005-09-01 2011-12-24 /archive/legal/contracts/
F003 financial_audit_2001.pdf 4.1 2002-03-17 2004-08-20 /archive/finance/audit/

Sample Dataset — AI Classification Summary

File ID Detected Category Sensitivity Level PII Detected? Retention Risk
F001 Employee HR Records High Yes Critical
F002 Vendor Contracts Medium No Moderate
F003 Financial Reports High No High

Sample Dataset — ML Anomaly Detection Flags

Alert ID File ID Anomaly Type AI Confidence Recommended Action
A001 F001 Contains PII but unmonitored for 17 years 0.97 Immediate Review
A002 F002 Contract expiry mismatch 0.82 Legal Validation
A003 F003 High-risk category stored in unsecured archive 0.91 Move to Secure Storage